Writer Identification through Information Retrieval: The Allograph Weight Vector
نویسندگان
چکیده
We show a number of promising results in writer identification, by recasting the traditional information retrieval (IR) problem of finding documents based on the frequency of occurrence of their terms. In IR, the tf-idf is a well-known statistical measure that weighs the importance of certain terms occurring in a database of documents. Here, writers are searched on the basis of the frequency of occurrence of particular character shapes: the allographs. The results show a high retrieval score. Moreover, by using the af-iwf (allograph frequency inverse writer frequency) measure, qualitative and quantitative analyses can be made that elaborate on the particular allograph shapes that lead to a successful writer identification. In this paper, we sketch the application of these techniques in forensic science.
منابع مشابه
Handwriting Analysis based on Segmentation Method for Prediction of Human Personality using Support Vector Machine
Handwriting analysis is a method to predict personality of an author and to better understand the writer. Allograph and allograph combination analysis is a scientific method of writer identification and evaluating the behavior. To make this computerized we considered six main different types of features: (i) size of letters, (ii) slant of letters and words, (iii) baseline, (iv) pen pressure, (v...
متن کاملAllograph Based Writer Adaptation for Handwritten Character Recognition
Writer adaptation is the process of converting a generic (writer-independent) handwriting recognizer into a personalized (writer-dependent) recognizer with improved accuracy for a particular user. While training the generic recognizer uses large amounts of data from several writers, the adaptation process uses only a few samples from a single user. In this paper we present a) an automatic appro...
متن کاملWriter Identification and Retrieval Using a Convolutional Neural Network
In this paper a novel method for writer identification and retrieval is presented. Writer identification is the process of finding the author of a specific document by comparing it to documents in a database where writers are known, whereas retrieval is the task of finding similar handwritings or all documents of a specific writer. The method presented is using Convolutional Neural Networks (CN...
متن کاملInformation Retrieval Based Writer Identification
In this paper, we apply an Information Retrieval model for the writer identification task. A set of local features is defined by clustering the graphemes produced by a segmentation procedure. Then a textual based Information Retrieval model is applied. After a first indexation step, this model no longuer requires image access to the database for responding to a specific query, thus making the p...
متن کاملHandwritten Document Analysis for Automatic Writer Recognition
In this paper, we show that both the writer identification and the writer verification tasks can be carried out using local features such as graphemes extracted from the segmentation of cursive handwriting. We thus enlarge the scope of the possible use of these two tasks which have been, up to now, mainly evaluated on script handwritings. A textual based Information Retrieval model is used for ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2008